This is a download of most of the archiveofourown.org stories using the downloader at http://code.google.com/p/fanficdownloader/ , archived by category and status, with the filename format category - author - title.txt along with an inventory text file containing the full path of every file in the archive, along with the corresponding ao3 link. The archive itself is compressed using 7z's ultra settings, and expands to ~18gb It contains every valid id up to ~700000 (at least that far, possibly a... ( 1 reviews ) Topics: ao3, archive, own
Grabbed during the dying days of the FortuneCities website, this 1.0gb collection of screenshots of Fortunecities user pages allows for an easy representation of the experience and information of this site. After 14 years of existence, Fortunecities was shut down in 2012, a victim of changing priorities and financial crunch. The collection includes 3,436 screenshots in .PNG format, along with .info files indicating the original location of the site in text format, and the web header return from... Topics: FortuneCities, Fortunecity, Screenshots, websites, Archive Team
EtherPad was a web-based collaborative real-time editor, allowing authors to simultaneously edit a text document, and see all of the participants' edits in real-time, with the ability to display each author's text in their own color. Very popular and in use by educators, businesses, and developers, Etherpad gained a strong following, but was later purchased by Google. With the introduction of the competing Wave application, Google announced a shutdown of Etherpad in favor of Wave. To outcry,... Topics: etherpad, archiveteam, archive
This is a download of http://forum.nos.nl/, the online discussion forums of Dutch public broadcaster NOS. It contains messages posted in 2005, 2006 and 2007 by visitors of the NOS website. Discussion topics include news, politics and NOS programmes. Downloaded June 2011. -- The archive is available in several formats: * a copy of the HTML page of each topic (including every message posted on the forum) * an XML file for each topic, providing the messages extracted from the HTML * a wget... Topics: forum.nos.org, NOS, Forum, Archive
Two collections of Boing Boing postings provided by the cultural website boingboing.net on its 5th and 11th anniversaries. Includes the HTML/text aspects of the postings, along with various author and creation information. From the 2011 BoingBoing.net posting: "Having very recently celebrated Boing Boing's eleventh bloggaversary, we're releasing an update of our previous archival release of Boing Boing posts. "This time, we're releasing a 120.3MB XML file (38.3MB zip) of 63,999 posts... Topics: BoingBoing, posts archive, blogging
Ripped bY THE ARCHiVERS for our brothers Archive Team. Visit us: http://w4r3zh4ck.blogspot.com/ Topics: archive, team, archive team, AT, archiveteam, the archivers, archivers, site rip, rip, w4r3zh4ck,...
Before its relaunch as a gaming website, Friendster was a social networking website that allowed users to connect with their friends. One of the elements of the site were the groups that members could join. This dataset contains the group memberships of all Friendster groups. It is the result of an extensive crawl of Friendster.com at the end of June 2011. It was performed as part of the ArchiveTeam project to archive part of the Friendster data before the service relaunched. The data files... Topics: Friendster, Groups, Group Lists, Membership Lists, Archive Team
Two archives (WARC, tar/gzip) of the Partyvan mirror at dnathe4th.porfusion.com. WARC file begun downloading 8 Oct 2013. Archive by ET3RN4L Topics: mirror, ET3RN4L, partyvan, dnathe4th, porfusion, dnathe4th.porfusion.com, porfusion.com, archive,...
Before its relaunch as a gaming website, Friendster was a social networking website that allowed users to connect with their friends. The central element of the site was the 'friends list', showing the contacts of the user. This dataset contains the connections between all Friendster users. It is the result of an extensive crawl of Friendster.com at the end of June 2011. It was performed as part of the ArchiveTeam project to archive part of the Friendster data before the service relaunched. The... Topics: Friendster, Friends, Friend Lists, Membership Lists, Archive Team
This is a Heritrix crawl of http://llink.nl/, the website of Dutch public broadcasting association LLiNK, made on 23 and 24 June 2011. It includes the main website as well as the programme-specific websites of LLiNK radio and television programmes. The crawl logs and order file are available in llink-20110624-crawl-logs.tar.bz2 -- The MD5 checksums of the files are: 050b714c6df98a29bdb6c1ff077c6953 llink-20110623100606-00000.warc.gz 49de8110b71ba8da7607eafbfe80fd50... Topics: LLiNK, Dutch, Archive, Webgrab, NPO